Domain mining for machine translation

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Domain mining for machine translation

Massive amounts of data for data mining consist of natural language data. A challenge in natural language is to translate the data into a particular language. Machine translation can do the translation automatically. However, the models trained on data from a domain tend to perform poorly for different domains. One way to resolve this issue is to train domain adaptation translation and language...

متن کامل

Domain Adaptation for Machine Translation by Mining Unseen Words

We show that unseen words account for a large part of the translation error when moving to new domains. Using an extension of a recent approach to mining translations from comparable corpora (Haghighi et al., 2008), we are able to find translations for otherwise OOV terms. We show several approaches to integrating such translations into a phrasebased translation system, yielding consistent impr...

متن کامل

Domain Modeling for Machine Translation

In this paper we present the domain model used in the VERBMOBIL [10] project, a project concerned with face-to-face dialogue interpreting funded by the German Ministry of Research and Technology. The current version of the domain model, which is used in the VERBMOBIL demonstrator, has been implemented in the Description Logic (DL) system FLEX and contains approximately 300 concepts and 170 role...

متن کامل

Robust machine translation for multi-domain tasks

In this thesis, we investigate and extend the phrase-based approach to statistical machine translation. Due to improved concepts and algorithms, the quality of the generated translation hypotheses has been significantly improved in recent years. Still, the translation quality leaves a lot to be desired when going beyond traditional translation tasks, such as newswire articles, and when addressi...

متن کامل

Effective Domain Mixing for Neural Machine Translation

Neural Machine Translation (NMT) models are often trained on heterogeneous mixtures of domains, from news to parliamentary proceedings, each with unique distributions and language. In this work we show that training NMT systems on naively mixed data can degrade performance versus models fit to each constituent domain. We demonstrate that this problem can be circumvented, and propose three model...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Intelligent & Fuzzy Systems

سال: 2015

ISSN: 1064-1246,1875-8967

DOI: 10.3233/ifs-151981